PyroClean: Denoising Pyrosequences from Protein-Coding Amplicons for the Recovery of Interspecific and Intraspecific Genetic Variation

نویسندگان

  • Ricardo Ramirez-Gonzalez
  • Douglas W. Yu
  • Catharine Bruce
  • Darren Heavens
  • Mario Caccamo
  • Brent C. Emerson
چکیده

High-throughput parallel sequencing is a powerful tool for the quantification of microbial diversity through the amplification of nuclear ribosomal gene regions. Recent work has extended this approach to the quantification of diversity within otherwise difficult-to-study metazoan groups. However, nuclear ribosomal genes present both analytical challenges and practical limitations that are a consequence of the mutational properties of nuclear ribosomal genes. Here we exploit useful properties of protein-coding genes for cross-species amplification and denoising of 454 flowgrams. We first use experimental mixtures of species from the class Collembola to amplify and pyrosequence the 5' region of the COI barcode, and we implement a new algorithm called PyroClean for the denoising of Roche GS FLX pyrosequences. Using parameter values from the analysis of experimental mixtures, we then analyse two communities sampled from field sites on the island of Tenerife. Cross-species amplification success of target mitochondrial sequences in experimental species mixtures is high; however, there is little relationship between template DNA concentrations and pyrosequencing read abundance. Homopolymer error correction and filtering against a consensus reference sequence reduced the volume of unique sequences to approximately 5% of the original unique raw reads. Filtering of remaining non-target sequences attributed to PCR error, sequencing error, or numts further reduced unique sequence volume to 0.8% of the original raw reads. PyroClean reduces or eliminates the need for an additional, time-consuming step to cluster reads into Operational Taxonomic Units, which facilitates the detection of intraspecific DNA sequence variation. PyroCleaned sequence data from field sites in Tenerife demonstrate the utility of our approach for quantifying evolutionary diversity and its spatial structure. Comparison of our sequence data to public databases reveals that we are able to successfully recover both interspecific and intraspecific sequence diversity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Molecular differentiation of sheep and cattle isolates of Fasciola hepatica using RAPD-PCR

Understanding genetic structure and status of genetic variation of Fasciola hepatica isolates from different hosts, has important implications on epidemiology and effective control of fasciolosis. Random amplified polymorphic DNA (RAPD-PCR) was used to study the genetic variation of F. hepatica in sheep and cattle. DNA was extracted from adult helminthes removed from livers of each infected ani...

متن کامل

Appraisal of the entire mitochondrial genome for DNA barcoding in birds

DNA barcoding based on a standardized region of 648 base pairs of mitochondrial DNAsequences from Cytochrome C Oxidase 1 (COX1) is proposed for animal species identification.Recent studies suggested that DNA barcoding has been effective for identifying 94% of birdspecies. The proposed threshold of 10 times the average intraspecific variation could be used forthe identification and delimitation ...

متن کامل

Intraspecific Variation in Leishmania major Isolated from Different Forms of Zoonotic Cutaneous Leishmaniasis

Background: Zoonotic cutaneous leishmaniasis (ZCL) is a polymorphic disease which may show various clinical manifestations. Although genetic variability of the parasite is suggested to be one of the factors influencing clinical manifestations in leishmaniasis, no data exist regarding genetic polymorphism of Leishmania major. Therefore, determination of genetic variation within the species of L....

متن کامل

A study on genetic differentiation in two species of Iranian bleaks, (Alburnus mossulensis) and (Alburnus caeruleus) (Teleostei, Cyprinidae) using simple sequence repeats

The genetic structure of the genus Alburnus is not well known and the phylogenetic relationships among its species are uncertain. In the present study, simple sequence repeats (SSRs or microsatellites) were used to evaluate genetic diversity and genetic differentiation between Alburnus mossulensis Heckel, 1843 from Kashgan River in Lorestan province and Alburnus caeruleus Heckel, 1843 from Gama...

متن کامل

Identification of Drought Tolerant Lines from Interspecific Hybridization in Two Different Genetic Backgrounds of Barley under Different Irrigation Regimes

Domestication and artificial selection have reduced the level of genetic variation in barley. Inter-specific hybridization is one of the most valuable ways to restore at least part of the lost variation. This study aimed to investigate genetic diversity and screening barley lines which possessed the desired traits, as well as drought tolerance, within two F3 populations derived by crossing a cu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2013